Functional Information in SWISS-PROT: the Basis for Large-scale Characterisation of Protein Sequences

نویسنده

  • Rolf Apweiler
چکیده

With the rapid growth of sequence databases, there is an increasing need for reliable functional characterisation and annotation of newly predicted proteins. To cope with such large data volumes, faster and more effective means of protein sequence characterisation and annotation are required. One promising approach is automatic large-scale functional characterisation and annotation, which is generated with limited human interaction. However, such an approach is heavily dependent on reliable data sources. The SWISS-PROT protein sequence database plays an essential role here owing to its high level of functional information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Sequence Annotation in the Genome Era: The Annotation Concept of SWISS-PROT + TREMBL

SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporati...

متن کامل

Inferring sub-cellular localization through automated lexical analysis

MOTIVATION The SWISS-PROT sequence database contains keywords of functional annotations for many proteins. In contrast, information about the sub-cellular localization is available for only a few proteins. Experts can often infer localization from keywords describing protein function. We developed LOCkey, a fully automated method for lexical analysis of SWISS-PROT keywords that assigns sub-cell...

متن کامل

Organisation and standardisation of information in SWISS_PROT and TrEMBL

SWISS-PROT is a curated, non-redundant protein sequence database which provides a high level of annotation and is integrated with a large number of other biological databases. It is supplemented by TrEMBL, a computer-annotated database which contains translations of all coding sequences in the EMBL Nucleotide Sequence Database which are not yet in SWISS-PROT. Each fully curated SWISSPROT entry ...

متن کامل

Neighborhood-Based Label Propagation in Large Protein Graphs

Understanding protein function is one of the keys to understanding life at the molecular level. It is also important in several scenarios including human disease and drug discovery (1). In this age of rapid and affordable biological sequencing, the number of sequences accumulating in databases is rising with an increasing rate (2). This presents many challenges for biologists and computer scien...

متن کامل

The Gene Ontology Annotation (GOA) Project—Application of GO in SWISS-PROT, TrEMBL and InterPro

As proteomics research gains momentum, biologists need new ways to access and analyse information on proteins. Many new gene products, from a wide range of species, are being added to the SWISS-PROT Protein Knowledgebase — the world’s most highly annotated protein sequence database — and its supplement, TrEMBL [3]. To fully exploit the potential of these data, the SWISSPROT group at EBI aims to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Briefings in bioinformatics

دوره 2 1  شماره 

صفحات  -

تاریخ انتشار 2001